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CHIMERIC POLYPEPTIDES OF SERUM ALBUMIN AND USES RELATED THERETO 

This application is a continuation-in-part of U.S. Application No. 09/619,285, filed July 
19, 2000, which is based on U.S. Provisional Application No. 60/144,534, filed July 19, 1999, the 
specification of which is hereby incorporated by reference in its entirety. 

Background of the Invention 

Recent advances in recombinant DNA technology have made available a wide range of 
biologically active peptides. Although in some instances molecular remodeling, for instance by 
ligated gene fusion or by site directed mutagenesis, has endowed such proteins with properties 
compatible with optimal activity, it is generally the case that effective use of these products can 
only be achieved through delivery systems. 

Polypeptide therapeutic agents, despite their promise in a number of disease treatments, 
are readily decomposed by gastric juices and by intestinal proteinases such as pepsin and trypsin. 
As a result, when these polypeptides are orally administered, they are barely absorbed and produce 
no effective pharmacological action. In order to obtain the desired biological activity, the 
polypeptides are at present usually dispensed in injectable dosage forms. However, the injectable 
route is inconvenient and painful to the patient, particularly when administration must occur on a 
regular and frequent basis. Consequently, efforts have focused recently on alternative methods jor^ 
administrMion of "such polypeptides. 

Such agents usually exhibit a short half-life in the circulation, being rapidly excreted 
through the kidneys or taken up by the reticuloendothelial system (RES) and other tissues. To 
compensate for such premature drug loss, larger doses are required so that sufficient amounts of 
drug can concentrate in areas in need of treatment. However, this is not only costly; it can also 
lead to toxicity and an immune response to the foreign protein. Sustained-release formulations 
(Putney, S.D. et al. Nature Biotechnology 1998, 75, 153-157) generally reduce the necessary 
dosage, but still depend on injection or more objectionable forms of delivery. A therapeutic 
protein with a longer half-life in the body would maintain a more stable blood level in much the 
same way as a sustained-release formulation, but would not entail the difficulties of preparing a 
sustained-release formulation and would require an even lower dosage because it is destroyed less 



quickly. For instance, cytokines such as interferon (IFN-gamma) and interleukin-2 (IL-2) would 
be more effective, less toxic and could be used in smaller quantities, if their presence in the 
circulation could be extended. 

Summary of the Invention 

One aspect of the present invention provides a chimeric polypeptide comprising a 
biologically active heterologous peptide fragment inserted into a serum albumin protein or a 
homolog thereof. The heterologous peptide fragment may optionally replace a portion of the 
serum albumin protein sequence. A peptide fragment which replaces a portion of the serum 
albumin protein sequence need not be of the same length as the fragment it replaces. A chimeric 
polypeptide according to this aspect may include more than one heterologous peptide fragment 
which replaces a portion of the serum albumin protein sequence. The included fragments may be 
identical, may be distinct sequences from a protein unrelated to serum albumin protein, or may be 
distinct sequences of unrelated origin. 

A chimeric polypeptide of this aspect, for example, may comprise the structure A-B-C, 
wherein A represents a first fragment of a serum albumin protein or homolog thereof, B represents 
a biologically active heterologous peptide sequence, and C represents another fragment of a serum 
albumin protein or a homolog thereof. Similarly, a chimeric polypeptide may comprise the 
structure A-B-C-D-E, wherein A, C, and E represent fragments of a serum albumin j^teinand B 
— and-D represent identicarbiorogicMly active heterologous peptide sequences, two different 
biologically active sequences of a protein unrelated to serum albumin protein, or two different 
biologically active sequences of two different proteins unrelated to serum albumin protein. 
Analogously, a chimeric polypeptide may comprise the structure A-B-C-D-E-F-G, wherein A, C, 
E, and G represent fragments of a serum albumin protein and B, D, and F represent identical 
biologically active heterologous peptide sequences, at least two different biologically active 
sequences of a protein unrelated to serum albumin protein, or at least two different biologically 
active sequences of two different proteins unrelated to serum albumin protein. In certain 
embodiments, a peptide fragment of serum albumin or a heterologous peptide sequence includes 
at least 6 amino acids, at least 12 amino acids, or at least 18 amino acids. 



A chimeric polypeptide may comprise the structure (A-B-C) n , e.g., -HN-(A-B-C) n -CO- or 
H 2 N-(A-B-C) n -C0 2 H, wherein A, independently for each occurrence, represents a fragment of 
serum albumin (SA), B, independently for each occurrence, represents a biologically active 
heterologous peptide sequence, C, independently for each occurrence, represents a second 
biologically active heterologous peptide sequence or a fragment of serum albumin (SA), and n is 
an integer greater than 0. In certain embodiments, a peptide fragment of serum albumin or a 
heterologous peptide sequence includes at least 6 amino acids, at least 12 amino acids, or at least 
18 amino acids. 

Alternatively, such a chimeric polypeptide may comprise an N-terminal fragment of a 
serum albumin protein or a homolog thereof, a biologically active heterologous peptide sequence, 
and a C-terminal fragment of a serum albumin protein or a homolog thereof. The heterologous 
peptide sequence may be between about 3 and about 500 or between about 4 and about 400 
residues in length, preferably between about 4 and about 200 residues, more preferably between 
about 4 and 100 residues, and most preferably between about 4 and about 20 residues. 

In one embodiment, the chimeric polypeptide has a half-life in the blood no less than 10 
days, preferably no less than about 14 days, and most preferably no less than 50% of the half-life 
of the native serum albumin protein or homolog thereof. 

In another embodiment, the heterologous peptide sequence is capable of binding to a cell 

surface receptor protein. Examples of such a receptor protein include a G protein-coupled 

"receptor, a tyrosine kinase receptor, a cytokine receptor, an MIRR receptor, and an orphan 
receptor. 

In another embodiment, the chimeric polypeptide is capable of binding to an extracellular 
receptor or ion channel. The chimeric polypeptide may be an agonist or an antagonist of an 
extracellular receptor or ion channel. The chimeric polypeptide of this embodiment may, for 
example, induce apoptosis, modulate cell proliferation, or modulate differentiation of cell types. 

The invention also comprises a nucleic acid sequence which encodes a chimeric 
polypeptide as described above. 

The invention further comprises a delivery vector, such as a viral or retroviral vector 
comprising a nucleic acid sequence encoding the chimeric polypeptide. Suitable vectors may 



include, for example, an adenovirus, an adeno-associated virus, a herpes simplex virus, a human 
immunodeficiency viruses, or a vaccinia virus. 

The invention also comprises a phaiWceutical composition comprising a chimeric 
polypeptide as described above, and methods for treating a disease in an organism by 
administering an effective dose of such a pharmaceutical composition to the organism. In a 
currently preferred embodiment, a chimeric polypeptide according to the invention comprises a 
fragment of an angiogenesis-inhibiting protein, sucn\as angiostatin or endostatin, as the 
heterologous peptide sequence and is capable of inhibiting angiogenesis. For example, a peptide 
fragment that inhibits angiogenesis and which may be incorporated into a subject polypeptide is 
RGD (Arg-Gly-Asp), or a sequence which includes the sequence RGD (e.g., VRGDF). Analogous 
methods maybe used to modulate conditions such as cell proliferation, cell differentiation, and 
cell death. 

In a currently preferred embodiment, the present invention provides a method of treating a 
disease in an organism by introducing into cells of the organism genetic material encoding a 
chimeric polypeptide protein comprising serum albumin protein or segments thereof and one or 
more therapeutic proteins or polypeptides or fragments thereof, such that the introduced genetic 
material is expressed by the transfected cells of the organism. Analogous methods may be used to 
modulate conditions such as cell proliferation, cell differentiation, and cell death. 

In another aspect, the present invention provides a method for treating a d^asejnan 

—organism by introducing g^tic'material encoding a chimeric polypeptide comprising serum 
albumin protein or segments thereof and one or more therapeutic proteins or polypeptides or 
fragments thereof into target cells ex vivo under conditions sufficient to cause the genetic material 
to be incorporated into the cell, thereby causing the cell to express the genetic material encoding 
said proteins or polypeptides. The target cells are then introduced into the host organism such that 
the introduced genetic material encoding said proteins or polypeptides is expressed by the target 
cells in the organism. The target cells may be selected from the group consisting of blood cells, 
skeletal muscle cells, smooth muscle cells, stem cells, skin cells, liver cells, secretory gland cells, 
hematopoietic cells, and marrow cells. 



Another aspect of the present invention provides transfected cells comprising target cells 
which have been exposed to a delivery vector comprising a nucleic acid encoding the chimeric 
protein or polypeptide of this invention. These cells are preferably selected from the group 
consisting of blood cells, skeletal muscle cells, smooth muscle cells, stem cells, skin cells, liver 
cells, secretory gland cells, hematopoietic cells, and marrow cells. 

Brief Description of the Figures 

Figure 1 shows the tertiary structure of human serum albumin (HSA). 

Figure 2 illustrates the transfection of cells with mouse serum albumin (MSA)-Myc fusion 
constructs and successful expression of the fusion protein, as well as binding of MSA and Myc 
antibodies to MSA-Myc fusion proteins depending on the location of the heterologous sequence 
in the MSA protein. 

Figure 3 depicts inhibition of FGF-induced proliferation of bovine capillary endothelial 
cells by RGD peptide and by MSA-myc-RGD fusion proteins. 

Figures 4A-I highlight loops of serum albumen which may be replaced with display 
therapeutic polypeptide sequences as described below. 

Figure 5 illustrates amino acid sequences for the display of therapeutic polypeptide 
sequences in the Cys 53 -Cys 62 loop of moiwe serum albumen. 

Figure 6 depicts the inhibitory effects of mouse serum albumen proteins as seybrthm 

Figure 5 b¥bovinecapiTlary endothelial (BCE) cells stimulated by FGF. 

Figure 7 illustrates the inhibitory effects of mouse serum albumen proteins as set forth in 
Figure 5 on human umbilical vein endothelial cells (HUVECs) stimulated by FGF. 

Figure 8 shows the induction of apoptosis induced by MSA-RGD fusion protein in NCI 
1869 human non-small cell lung carcinoma cell line. 

Detailed Description of the Invention 

The systems and methods disclosed herein are directed towards increasing the lifetime of 
therapeutic polypeptides in the bloodstream by creating chimeric polypeptides containing 
segments of serum albumin (SA) and segments of biologically active heterologous peptide 



sequences. SA is the major protein constituent of the circulatory system, has a half-life in the 
blood of about three weeks (Rothschild, M.A. et al. Hepatology 1988, 8, 385-401), and is present 
in quantity (40 g/L in the serum). It is also known that the normal adult human liver produces 
approximately 15 grams of human serum albumin (HSA) per day, or about 200 mg per kilogram 
of body weight. Serum albumin has no immunological activity or enzymatic function, and is a 
natural carrier protein used to transport many natural and therapeutic molecules. Fusion proteins 
wherein a therapeutic polypeptide has been covalently linked to serum albumin have been shown 
to have serum half-lives many times longer than the half-life of the therapeutic peptide itself 
(Syed, S. et al. Blood 1997, 89, 3243-3252; Yeh, P. et al. Proc. Natl Acad. Set USA 1992, 89, 
1904-1908). In both cited publications, the half-life of the fusion protein was more than 140 times 
greater than that of the therapeutic polypeptide itself, and approached the half-life of unfused 
serum albumin. Furthermore, the amino-terminal portion of serum albumin has been found to 
favor particularly efficient translocation and export of the fusion proteins in eukaryotic cells (PCT 
publication WO 90/13653). Generally, this means that such proteins are more efficiently secreted 
by a cell manufacturing such proteins than are the free therapeutic polypeptides themselves. 

From a drug delivery standpoint, chimeric polypeptides of serum albumin proteins offer 
substantial promise because serum albumins are found in tissues and secretions throughout the 
body. It is known, for example, that serum albumin is responsible for the transport of compounds 

across organ-circulatory interfaces into such organs as the liver, intestine, kidney, and brain 

"Chimeric pfdteinsof serum albumin may thus manifest their biological activity anywhere in the 
body, crossing even the daunting blood-brain barrier. 

The three-dimensional structure and the chemistry of SA have been well studied (Carter, 
D.C. et al. Eur. J. Biochem. 1994, 226, 1049-1052; He, X.M. et al. Nature 1992, 358, 209-215; 
Carter, D.C. et al. Science 1989, 244, 1 195-1 198). Thus, rather than relying on simple, binary 
fusion proteins as discussed above, portions of the SA protein may be strategically or 
combinatorially replaced by therapeutic polypeptides. For example, cysteine-constrained loops 
may be selected for replacement, e.g., on the presumption that structural changes to the loop are 
likely to minimally affect the tertiary structure of the protein as a whole. Figures 4A-I show the 
locations of several such loops on the mouse serum albumen protein. Effective replacement and 



insertion into such loops is demonstrated in the Examples below. The present invention 
contemplates insertion into or replacement of any one of the loops depicted in Figures 4A-I, or 
any combination of such loops. In certain embodiments, a loop selected for insertion or 
replacement is located at or near the surface of the serum albumen protein to facilitate 
intermolecular interactions. One of skill in the art will readily be able to adapt these techniques to 
other serum albumen proteins, e.g., bovine, human, and other serum albumen proteins. 

Techniques of combinatorial mutagenesis combined with structurally motivated grafting 
procedures allow the random preparation of a library of many related polypeptides which carry a 
biologically active peptide fragment and are substantially similar to serum albumin in tertiary 
structure. For example, a chimeric polypeptide of the present invention may include a biologically 
active heterologous peptide sequence inserted into the peptide sequence of a serum albumin 
protein. The inserted sequence may optionally replace a portion of the serum albumin sequence, 
whether that portion is of similar or dissimilar length. In some cases, more than one insertion may 
be required to obtain the desired biological activity. Alternatively, a biologically active 
heterologous peptide sequence may be placed between two fragments of a serum albumin 
sequence to create such a chimeric polypeptide. Optionally, one or more additional biologically 
active peptide sequences may be placed between fragments of serum albumin protein. Chimeric 
polypeptides of the present invention may also be described as a biologically active heterologous 
peptide sequence flanked on one side by an N-terminal fragment of serum afcumin ^protein _andon 
the other side~by a C-terminal fragment of serum albumin protein. 

The advantage of such chimeric polypeptides is that the similarity to serum albumin 
protein in structure may camouflage these polypeptides to biological mechanisms which degrade 
foreign peptides even more effectively than known fusion proteins, because the foreign 
polypeptide fragments are carried on a protein that is substantially similar to a protein that is 
pervasive within the organism. Such proteins may retain the beneficial characteristics of serum 
albumin (non-immunogenicity, high level of expression, efficient secretion, and long half-life), 
while supporting the additional desired biological function. 

Many therapeutic applications of such chimeric polypeptides will be obvious to those 
skilled in the art. For example, inclusion of a peptide fragment which inhibits cell proliferation 



might serve as a treatment for cancer and other diseases characterized by cell proliferation known 
to those in the art. Inclusion of a peptide fragment which modulates the differentiation of 
immature cells into particular cell types may create a chimeric polypeptide which may be effective 
in the treatment of neurological conditions, e.g., nerve damage and neurodegenerative diseases, 
hyperplastic and neoplastic disorders of pancreatic tissue, and other conditions characterized by 
undesirable proliferation and differentiation of tissue. Inclusion of a peptide fragment which 
induces apoptosis may provide a polypeptide effective in treating diseases marked by unwanted 
cell proliferation, such as cancer, and other conditions known to those in the art as amenable to 
apoptotic therapy. Inclusion of an anti-angiogenic peptide fragment, e.g., a fragment of angiostatin 
or endostatin, may yield a chimeric polypeptide useful in the treatment of cancer and other 
conditions resulting from or enabled by angiogenesis. 

Definitions 

The term 'peptide 1 refers to an oligomer in which the monomers are amino acids (usually 
alpha-amino acids) joined together through amide bonds. Peptides are two or more amino acid 
monomers long, but more often are between 5 to 10 amino acid monomers long and can be even 
longer, i.e., up to 20 amino acids or more, although peptides longer than 20 amino acids are more 
likely to be called 'polypeptides 1 . The term 'protein' is well known in the art and usually refers to a 
very large polypeptide, or set of associated homologous or heterologousj^lypeptides, Jhat has some_ 
biological function. For purposes of the present invention the terms 'peptide', 'polypeptide', and 
'protein 1 are largely interchangeable as all three types are collectively referred to as peptides. 

The interchangeable terms 'fusion 1 and 'chimeric 1 , as used herein to describe proteins and 
polypeptides, relate to polypeptides or proteins wherein two individual polypeptides or portions 
thereof are fused to form a single amino acid chain. Such fusion may arise from the expression of a 
single continuous coding sequence formed by recombinant DNA techniques. Thus, 'fusion' 
polypeptides and 'chimeric' polypeptides include contiguous polypeptides comprising a first 
polypeptide covalently linked via an amide bond to one or more amino acid sequences which define 
polypeptide domains that are foreign to and not substantially homologous with any domain of the first 
polypeptide. 



Gene constructs encoding fusion proteins are likewise referred to a Chimeric genes 1 or 'fusion 

genes 1 . 

'Homology 1 and 'identity 1 each refer to sequence similarity between two polypeptide 
sequences, with identity being a more strict comparison. Homology and identity can each be 
determined by comparing a position in each sequence which may be aligned for purposes of 
comparison. When a position in the compared sequence is occupied by the same amino acid residue, 
then the polypeptides can be referred to as identical at that position; when the equivalent site is 
occupied by the same amino acid (e.g., identical) or a similar amino acid (e.g., similar in steric and/or 
electronic nature), then the molecules can be referred to as homologous at that position. A percentage 
of homology or identity between sequences is a function of the number of matching or homologous 
positions shared by the sequences. An 'unrelated', 'heterologous', or 'non-homologous' sequence 
shares less than 40 percent identity, though preferably less than 25 percent identity, with a sequence 
to which it is compared. Thus, a 'heterologous peptide sequence' is a peptide sequence substantially 
dissimilar to a sequence to which it is compared. 

The term 'serum albumin' (SA) is intended to include (but not necessarily to be restricted to) 
serum albumin proteins of living organisms, preferably mammalian serum albumins, even more 
preferably known or yet-to-be-discovered polymorphic forms of human serum albumin (HSA), and 
variants thereof For example, the human serum albumin Naskapi has Lys-372 in place of Glu-372, 
and albumin Christchurch has an altered pro-sequence. The term 'variants' intended toJnclude_(but_ 
"not" necessarily be restricted to) homologs of S A proteins with minor artificial variations in sequence 
(such as molecules lacking one or a few residues, having conservative substitutions or minor 
insertions of residues, or having minor variations of amino acid structure). Thus, polypeptides which 
have 80%, 85%, 90%, or 99% homology with a native SA are deemed to be 'variants'. It is also 
preferred for such variants to share at least one pharmacological utility with a native SA. Any putative 
variant which is to be used pharmacologically should be non-immunogenic in the animal (especially 
human) being treated. Sequences of a number of contemplated serum albumin proteins can be 
obtained from GenBank (National Center for Biotechnology Information), including human, bovine, 
mouse, pig, horse, sheep, and chick serum albumins. 



The term 'native* is used to describe a protein which occurs naturally in a living organism. 
Wild-type proteins are thus native proteins. Proteins which are non-native are those which have been 
generated by artificial mutation, recombinant design, or other laboratory modification and are not 
known in natural populations. 

'Conservative substitutions* are those where one or more amino acids are substituted for others 
having similar properties such that one skilled in the art of polypeptide chemistry would expect at 
least the secondary structure, and preferably the tertiary structure, of the polypeptide to be 
substantially unchanged. For example, typical such substitutions include asparagine for glutamine, 
serine for asparagine, and arginine for lysine. The term 'physiologically functional equivalents' also 
encompasses larger molecules comprising the native sequence plus a further sequence at the N- 
terminus (for example, pro-HSA, pre-pre-HS A, and met-HSA). 

Tertiary structure 1 refers to the three-dimensional structure of a protein. Proteins which have 
similar tertiary structures will have similar shapes and surfaces, even if the amino acid sequences (the 
'secondary structure') is not identical. Tertiary structure is a consequence of the folding and twisting 
of an amino acid chain upon itself and can be disrupted by chemical means, e.g., strong acid or base, 
or by physical means, e.g., heating. 

The term 'biologically active' refers to an entity which interacts in some way with a living 
organism on a molecular level. Entities which are biologically active may activate a receptor, provoke 
an immune reaction, interact with a membrane oj^icm^hzuiiie^ 
biological function of an organism or any part of an organism. 

The term 'ligand' refers to a molecule that is recognized by a particular protein, e.g., a 
receptor. Any agent bound by or reacting with a protein is called a 'ligand', so the term encompasses 
the substrate of an enzyme and the reactants of a catalyzed reaction. The term 'ligand 1 does not imply 
any particular molecular size or other structural or compositional feature other than that the substance 
in question is capable of binding or otherwise interacting with a protein. A 'ligand' may serve either 
as the natural ligand to which the protein binds or as a functional analogue that may act as an agonist 
or antagonist. 

The term 'vector' refers to a DNA molecule, capable of replication in a host cell, into which 
a gene can be inserted to construct a recombinant DNA molecule. Examples of vectors include 
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plasmids and infective microorganisms such as viruses, or non-viral vectors such as ligand-DNA 
conjugates, liposomes, or lipid-DNA complexes. 

As used herein, 'cell surface receptor 1 refers to molecules that occur on the surface of cells, 
interact with the extracellular environment, and (directly or indirectly) transmit or transduce the 
information regarding the environment intracellular^ in a manner that may modulate intracellular 
second messenger activities or transcription of specific promoters, resulting in transcription of 
specific genes. 

As used herein, 'extracellular signals 1 include a molecule or other change in the extracellular 
environment that is transduced intracellularly via cell surface proteins that interact, directly or 
indirectly, with the signal. An extracellular signal or effector molecule includes any compound or 
substance that in some manner alters the activity of a cell surface protein. Examples of such signals 
include, but are not limited to, molecules such as acetylcholine, growth factors and hormones, lipids, 
sugars and nucleotides that bind to cell surface and/or intracellular receptors and ion channels and 
modulate the activity of such receptors and channels. 

As used herein, Extracellular signals 1 also include as yet unidentified substances that modulate 
the activity of a cellular receptor, and thereby influence intracellular functions. Such extracellular 
signals are potential pharmacological agents that may be used to treat specific diseases by modulating 
the activity of specific cell surface receptors. 

'Orphan receptors' is a designatio^giyejU^^ 
been described and/or for which no function has been determined. 

The term 'target cells 1 as used herein means cells, either in vivo or ex vivo, into which it is 
desired to introduce exogenous genetic material. Target cells may be any type of cell, including blood 
cells, skeletal muscle cells, stem cells, skin cells, liver cells, secretory gland cells, hematopoietic cells, 
and marrow cells. 

An 'effective amount' of a fusion polypeptide, with respect to the subject method of treatment, 
refers to an amount of the polypeptide in a preparation which, when applied as part of a desired 
dosage regimen, provides inhibition of angiogenesis so as to reduce or cure a disorder according to 
clinically acceptable standards. 



'Serum half-life 1 as used herein refers to the time required for half of a quantity of a peptide 
in the bloodstream to be degraded. 

Exemplification 

As set out above, the chimeric polypeptide of the present invention can be constructed as a 
chimeric polypeptide containing a sequence homologous to at least a portion of a serum albumin and 
at least a portion of one or more heterologous proteins, expressed as one contiguous polypeptide 
chain. In preparing the chimeric polypeptide, a fusion gene is constructed comprising DNA encoding 
at least one sequence each of a serum albumin, a heterologous protein, and, optionally, a peptide 
linker sequence to span the fragments. If more than one heterologous sequences are included in the 
chimeric polypeptide, they may be identical, related, or unrelated sequences. Identical sequences may 
be included to increase the effective concentration of the sequence. Related sequences may be 
included to more accurately mimic the native protein from which they are derived. Unrelated 
sequences may be useful for activating two or more distinct receptors that stimulate the same 
response, or for imparting two or more distinct activities to the chimeric polypeptide. For example, 
the chimeric polypeptide might include a sequence that has antiangiogenic activity and a sequence 
which induces apoptosis of tumor cells. 

To make this chimeric polypeptide, an entire protein can be cloned and expressed as part of 
the protein, or alternatively, a suitable fragment thereof containing jjbiologically active. moiety can_- 
be used. The use of recombinant DNA techniques to create a fusion gene, with the translational 
product being the desired chimeric polypeptide, is well known in the art. Both the coding sequence 
of a gene and its regulatory regions can be redesigned to change the functional properties of the 
protein product, the amount of protein made, or the cell type in which the protein is produced. The 
coding sequence of a gene can be extensively altered, for example, by fusing part of it to the coding 
sequence of a different gene to produce a novel hybrid gene that encodes a fusion protein. Examples 
of methods for producing fusion proteins are described in PCT applications PCT/US87/02968, 
PCT/US89/03587 and PCT/US90/07335, as well as Traunecker et al. (1989) Nature 339:68, all of 
which are incorporated by reference herein. 
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Techniques for making fusion genes are well known. Essentially, the joining of various DNA 
fragments coding for different polypeptide sequences is performed in accordance with conventional 
techniques, employing blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion 
to provide for appropriate termini, filling in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. Alternatively, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. In another method, 
PCR amplification of gene fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive gene fragments which can subsequently be 
annealed to generate a chimeric gene sequence (see, for example, Current Protocols in Molecular 
Biology, Eds. Ausubel et al. John Wiley & Sons: 1992). 

This invention also provides expression vectors comprising a nucleotide sequence encoding 
a subject chimeric polypeptide operably linked to at least one regulatory sequence. 'Operably linked 1 
is intended to mean that the nucleotide sequence is linked to a regulatory sequence in a manner which 
allows expression of the nucleotide sequence. Regulatory sequences are art-recognized and are 
selected to direct expression of the encoded polypeptide. Accordingly, the term regulatory sequence 
includes promoters, enhancers and other expression control elements. Exemplary regulatory 
sequences are described in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990). For instance, any of a wide variety of expression control 

sequences-sequences that control the expression of a DNA sequence when_op^atively linkeitQ it 

may be used in these vectors to express DNA sequences encoding the chimeric polypeptides of this 
invention. Such useful expression control sequences, include, for example, the early and late 
promoters of SV40, adenovirus or cytomegalovirus immediate early promoter, the lac system, the trp 
system, the TAC or TRC system, T7 promoter whose expression is directed by T7 RNA polymerase, 
the major operator and promoter regions of phage lambda, the control regions for fd coat protein, the 
promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid 
phosphatase, e.g., Pho5, the promoters of the yeast ex-mating factors, the polyhedron promoter of the 
baculovirus system and other sequences known to control the expression of genes of prokaryotic or 
eukaryotic cells or their viruses, and various combinations thereof. It should be understood that the 
design of the expression vector may depend on such factors as the choice of the host cell to be 
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transformed and/or the type of protein desired to be expressed. Moreover, the vector's copy number, 
the ability to control that copy number and the expression of any other proteins encoded by the vector, 
such as antibiotic markers, should also be considered. 

As will be apparent, the subject gene constructs can be used to cause expression of the subject 
chimeric polypeptides in cells propagated in culture, e.g., to produce chimeric polypeptides, for 
purification. This represents a method for preparing substantial quantities of the polypeptide, e.g., 
for research, clinical, and pharmaceutical uses. 

In certain therapeutic applications, the ex v/w-derived chimeric polypeptides are utilized in 
a manner appropriate for therapy in general. For such therapy, the polypeptides of the invention can 
be formulated for a variety of modes of administration, including systemic and topical or localized 
administration. In such embodiments, the polypeptide may by combined with a pharmaceutical^ 
acceptable excipient, e.g., a non-pyrogenic excipient. Techniques and formulations generally may be 
found in Remmington's Pharmaceutical Sciences . Meade Publishing Co., Easton, PA. For systemic 
administration, injection being preferred, including intramuscular, intravenous, intraperitoneal, and 
subcutaneous injection, the polypeptides of the invention can be formulated in liquid solutions, 
preferably in physiologically compatible buffers such as Hank's solution or Ringer's solution. In 
addition, the polypeptides may be formulated in solid form and redissolved or suspended immediately 
prior to use. Lyophilized forms are also included. 

Systemic administration can also be by transmucosal or transdermal means, or the compounds 
can be administered orally. For transmucosal or transdermal administration, penetrants appropriate 
to the barrier to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration bile salts and fusidic acid 
derivatives. In addition, detergents may be used to facilitate permeation. Transmucosal 
administration may be through nasal sprays or using suppositories. For oral administration, the 
peptides are formulated into conventional oral administration forms such as capsules, tablets, and 
tonics. For topical administration, particularly cosmetic formulations, the oligomers of the invention 
are formulated into ointments, salves, gels, or creams as generally known in the art. 

Alternative means of administration of peptides have been developed. Sustained-release 
formulations (Putney, et al. Nature Biotechnology 1998, 16, 153-157) are advantageous, requiring 
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fewer administrations and, often, lower dosages. Techniques for oral delivery of peptides have been 
reviewed (Fasano, A. Trends in Biotechnology 1998, 16, 152-157), as have several site-specific 
means of peptide delivery (Pettit, D.K. et al. Trends in Biotechnology 1998, 76, 343-349). Additional 
techniques for therapeutic administration of peptides are known to those of skill in the art. 

Genetic material of the present invention can be delivered, for example, as an expression 
plasmid which, when transcribed in the cell, produces the desired chimeric polypeptide. 

In another embodiment, the genetic material is provided by use of an "expression" construct, 
which can be transcribed in a cell to produce the chimeric polypeptide. Such expression constructs 
may be administered in any biologically effective carrier, e.g., any formulation or composition 
capable of effectively transfecting cells either ex vivo or in vivo with genetic material encoding a 
chimeric polypeptide. Approaches include insertion of the antisense nucleic acid in viral vectors 
including recombinant retroviruses, adenoviruses, adeno-associated viruses, human 
immunodeficiency viruses, and herpes simplex viruses- 1, or recombinant bacterial or eukaryotic 
plasmids. Viral vectors can be used to transfect cells directly; plasmid DNA can be delivered with 
the help of, for example, cationic liposomes (lipofectin) or derivatized (e.g., antibody conjugated), 
polylysine conjugates, gramacidin S, artificial viral envelopes or other such intracellular carriers, as 
well as direct injection of the gene construct or calcium phosphate precipitation carried out in vivo. 
It will be appreciated that because transduction of appropriate target cells represents the critical first 
step in gene therapy, choice of the particular gene delivery system will depend on such factors as the 
"phenotype of the intended target and the route of administration, e.g., locally or systemically. 

A preferred approach for in vivo introduction of genetic material encoding one of the subject 
proteins into a cell is by use of a viral vector containing said genetic material. Infection of cells with 
a viral vector has the advantage that a large proportion of the targeted cells can receive the nucleic 
acid. Additionally, chimeric polypeptides encoded by genetic material in the viral vector, e.g., by a 
nucleic acid contained in the viral vector, are expressed efficiently in cells which have taken up viral 
vector nucleic acid. Such a strategy may be particularly effective when skeletal muscle cells are the 
targets of the vector (Fisher, KJ. et al. Nature Medicine 1997, J, 306-3 12). 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be the 
recombinant gene delivery system of choice for the transfer of exogenous genes in vivo, particularly 



-15- 



into humans. These vectors provide efficient delivery of genes into cells, and the transferred nucleic 
acids are stably integrated into the chromosomal DNA of the host. A major prerequisite for the use 
of retroviruses is to ensure the safety of their use, particularly with regard to the possibility of the 
spread of wild-type virus in the cell population. The development of specialized cell lines (termed 
"packaging cells") which produce only replication-defective retroviruses has increased the utility of 
retroviruses for gene therapy, and defective retroviruses are well characterized for use in gene transfer 
for gene therapy purposes (for a review see Miller, A.D. (1990) Blood 76:271). Thus, recombinant 
retrovirus can be constructed in which part of the retroviral coding sequence (gag, pol, env) has been 
replaced by nucleic acid encoding one of the antisense E6AP constructs, rendering the retrovirus 
replication defective. The replication defective retrovirus is then packaged into virions which can be 
used to infect a target cell through the use of a helper virus by standard techniques. Protocols for 
producing recombinant retroviruses and for infecting cells in vitro or in vivo with such viruses can 
be found in Current Protocols in Molecular Biology . Ausubel, F.M. et al. (eds.) Greene Publishing 
Associates, (1989), Sections 9.10-9.14, and other standard laboratory manuals. Examples of suitable 
retroviruses include pLJ, pZIP, pWE and pEM which are well known to those skilled in the art. 
Examples of suitable packaging virus lines for preparing both ecotropic and amphotropic retroviral 
systems include \|/Crip, v|/Cre, \|/2 and \|/Am. Retroviruses have been used to introduce a variety of 
genes into many different cell types, including neural cells, epithelial cells, endothelial cells, 
lymphocytes, myoblasts, hepatocytes, bone marrow cells, in vitro and/or in vivo (^Jb^exampje_ 
Eglitisret al: ^ Danos and Mulligan (1988) Proa Natl Acad, ScL USA 

85:6460-6464; Wilson et al. (1988) Proc. Natl Acad, Set USA 85:3014-3018; Armentano et al. 
(1990) Proc, Natl Acad, Sci, USA 87:6141-6145; Huber et al. (1991) Proc, Natl Acad Set USA 
88:8039-8043; Ferry et al. (1991) Proc, Natl, Acad ScL USA 88:8377-8381; Chowdhury et al. (1991) 
Science 254:1802-1805; van Beusechem et al. (1992) Proc, Natl, Acad Sci. USA 89:7640-7644; Kay 
et al. (1992) Human Gene Therapy 3:641-647; Dai et al. (1992) Proc, Natl Acad, Sci, USA 89:10892- 
10895; Hwu et al. (1993)7. Immunol, 150:4104-4115; U.S. Patent No. 4,868,116; U.S. Patent No. 
4,980,286; PCT Application WO 89/07136; PCT Application WO 89/02468; PCT Application WO 
89/05345; and PCT Application WO 92/07573). 
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In choosing retroviral vectors as a gene delivery system for genetic material encoding the 
subject chimeric polypeptides, it is important to note that a prerequisite for the successful infection 
of target cells by most retroviruses, and therefore of stable introduction of the genetic material, is that 
the target cells must be dividing. In general, this requirement will not be a hindrance to use of 
retroviral vectors. In fact, such limitation on infection can be beneficial in circumstances wherein the 
tissue (e.g., nontransformed cells) surrounding the target cells does not undergo extensive cell 
division and is therefore refractory to infection with retroviral vectors. 

Furthermore, it has been shown that it is possible to limit the infection spectrum of 
retroviruses and consequently of retro viral-based vectors, by modifying the viral packaging proteins 
on the surface of the viral particle (see, for example, PCT publications W093/25234, WO94/06920, 
and W094/11524). For instance, strategies for the modification of the infection spectrum of 
retroviral vectors include: coupling antibodies specific for cell surface antigens to the viral env protein 
(Roux et al. (1989) PNAS 86:9079-9083; Julan et al. (1992) J. Gen Virol 73:3251-3255; and Goud 
et al. (1983) Virology 163:251-254); or coupling cell surface ligands to the viral env proteins (Neda 
et al. (1991) J Biol Chem 266:14143-14146). Coupling can be in the form of the chemical cross- 
linking with a protein or other variety (e.g., lactose to convert the env protein to an 
asialoglycoprotein), as well as by generating chimeric proteins (e.g., single-chain antibody/ewv 
chimeric proteins). This technique, while useful to limit or otherwise direct the infection to certain 

tissue types, and can also be used to convert an ecotropic vector in to an amphotropic vector. 

Moredver, - us€Tof retroviral gene delivery can be further enhanced by the use of tissue- or cell- 
specific transcriptional regulatory sequences which control expression of the genetic material of the 
retroviral vector. 

Another viral gene delivery system useful in the present invention utilizes adenovirus-derived 
vectors. The genome of an adenovirus can be manipulated such that it encodes a gene product of 
interest, but is inactive in terms of its ability to replicate in a normal lytic viral life cycle (see, for 
example, Berkner et al. (1988) BioTechniques 6:616; Rosenfeld et al. (1991) Science 252:431-434; 
and Rosenfeld et al. (1992) Cell 68:143-155). Suitable adenoviral vectors derived from the 
adenovirus strain Ad type 5 dl324 or other strains of adenovirus (e.g., Ad2, Ad3, Ad7, etc.) are well 
known to those skilled in the art. Recombinant adenoviruses can be advantageous in certain 
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circumstances in that they are capable of infecting non-dividing cells and can be used to infect a wide 
variety of cell types, including airway epithelium (Rosenfeld et al (1992) cited supra), endothelial 
cells (Lemarchand et al. (1992) Proc. Natl Acad. Set USA 89:6482-6486), hepatocytes (Herz and 
Gerard (1993) Proc. Natl Acad. Set USA 90:2812-2816) and muscle cells (Quantin et al. (1992) 
Proc. Natl Acad. Sci. USA 89:2581-2584). Furthermore, the virus particle is relatively stable and 
amenable to purification and concentration, and, as above, can be modified so as to affect the 
spectrum of infectivity. Additionally, introduced adenoviral DNA (and foreign DNA contained 
therein) is not integrated into the genome of a host cell but remains episomal, thereby avoiding 
potential problems that can occur as a result of insertional mutagenesis in situations where introduced 
DNA becomes integrated into the host genome (e.g., retroviral DNA). Moreover, the carrying 
capacity of the adenoviral genome for foreign DNA is large (up to 8 kilobases) relative to other gene 
delivery vectors (Berkner et al., supra; Haj-Ahmand and Graham (1986) J. Virol 57:267). Most 
replication-defective adenoviral vectors currently in use and therefore favored by the present 
invention are deleted for all or parts of the viral El and E3 genes but retain as much as 80% of the 
adenoviral genetic material (see, for example, Jones et al. (1979) Cell 16:683; Berkner et al., supra; 
and Graham et al. in Methods in Molecular Biology . EJ. Murray, Ed. (Humana, Clifton, NJ, 1991) 
vol. 7. pp. 109-127). Expression of the inserted genetic material can be under control of, for example, 
the El A promoter, the major late promoter (MLP) and associated leader sequences, the E3 promoter, 

or exogenously added promoter sequences. 

Yet Mother viral vector system useful for delivery of genetic material encoding the subject 

chimeric polypeptides is the adeno-associated virus (AAV). Adeno-associated virus is a naturally 
occurring defective virus that requires another virus, such as an adenovirus or a herpes virus, as a 
helper virus for efficient replication and a productive life cycle. (For a review see Muzyczka et al. 
Curr. Topics in Micro, and Immunol. (1992) 158:97-129). It is also one of the few viruses that may 
integrate its DNA into non-dividing cells, and exhibits a high frequency of stable integration (see for 
example Flotte et al. (1992) Am. J. Respir. Cell. Mol Biol 7:349-356; Samulski et al. (1989) J. Virol. 
63:3822-3828; and McLaughlin et al. (1989) J. Virol 62:1963-1973). Vectors containing as little as 
300 base pairs of AAV can be packaged and can integrate. Space for exogenous DNA is limited to 
about 4.5 kb. An AAV vector such as that described in Tratschin et al. (1985) Mol Cell. Biol 
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5:3251-3260 can be used to introduce DNA into cells. A variety of nucleic acids have been 
introduced into different cell types using AAV vectors (see for example Hermonat et al. (1984) Proc. 
Natl Acad. Set USA 81:6466-6470; Tratschin et al. (1985) Mol Cell Biol 4:2072-2081; Wondisford 
et al. (1988) Mol Endocrinol 2:32-39; Tratschin et al. (1984) 1 Virol 51:611-619; and Flotte et al. 
(1993) J. Biol Chem. 268:3781-3790). 

Other viral vector systems that may have application in gene therapy have been derived from 
herpes virus, vaccinia virus, and several RNA viruses. 

In addition to viral transfer methods, such as those illustrated above, non-viral methods can 
also be employed to cause expression of genetic material encoding the subject chimeric polypeptides 
in the tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms used 
by mammalian cells for the uptake and intracellular transport of macromolecules. In preferred 
embodiments, non-viral gene delivery systems of the present invention rely on endocytic pathways 
for the uptake of genetic material by the targeted cell. Exemplary gene delivery systems of this type 
include liposomal derived systems, polylysine conjugates, and artificial viral envelopes. 

In a representative embodiment, genetic material can be entrapped in liposomes bearing 
positive charges on their surface (e.g., lipofectins) and, optionally, which are tagged with antibodies 
against cell surface antigens of the target tissue (Mizuno et al. (1992) No Shinkei Geka 20:547-551; 
PCT publication WO91/06309; Japanese patent application 1047381; and European patent 
publication EP-A-43075). For example, lipofection of papilloma-infected cells can be _carried out 
using'lipb^omes tagged with monoclonal antibodies against PV-associated antigen (see Viae et al. 
(1978) J Invest Dermatol 70:263-266; see also Mizuno et al. (1992) Neurol Med. Chir. 32:873-876). 

In yet another illustrative embodiment, the gene delivery system comprises an antibody or cell 
surface ligand which is cross-linked with a gene binding agent such as polylysine (see, for example, 
PCT publications WO93/04701, W092/22635, WO92/20316, W092/19749, and WO92/06180). For 
example, genetic material encoding the subject chimeric polypeptides can be used to transfect 
hepatocytic cells in vivo using a soluble polynucleotide carrier comprising an asialoglycoprotein 
conjugated to a polycation, e.g., polylysine (see U.S. Patent 5,166,320). It will also be appreciated 
that effective delivery of the subject nucleic acid constructs via mediated endocytosis can be 
improved using agents which enhance escape of the gene from the endosomal structures. For instance, 
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whole adenovirus or fiisogenic peptides of the influenza HA gene product can be used as part of the 
delivery system to induce efficient disruption of DNA-containing endosomes (Mulligan et al. (1993) 
Science 260-926; Wagner et al. (1992) PNAS 89:7934; and Christiano et al. (1993) PNAS 90:2122). 

In clinical settings, the gene delivery systems can be introduced into a patient by any of a 
number of methods, each of which is familiar in the art. For instance, a pharmaceutical preparation 
of the gene delivery system can be introduced systemically, e.g., by intravenous injection, and specific 
transduction of the target cells occurs predominantly from specificity of transfection provided by the 
gene delivery vehicle, cell-type or tissue-type expression due to the transcriptional regulatory 
sequences controlling expression of the gene, or a combination thereof. In other embodiments, initial 
delivery of the recombinant gene is more limited with introduction into the animal being quite 
localized. For example, the gene delivery vehicle can be introduced by catheter (see U.S. Patent 
5,328,470) or by stereotactic injection (e.g., Chen et al. (1994) PNAS 91 : 3054-3057). 

Moreover, the pharmaceutical preparation can consist essentially of the gene delivery system 
in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery vehicle is 
imbedded. Alternatively, where the complete gene delivery system can be produced intact from 
recombinant cells, e.g., retroviral packages, the pharmaceutical preparation can comprise one or more 
cells which produce the gene delivery system. In the latter case, methods of introducing the viral 
packaging cells may be provided by, for example, rechargeable or biodegradable devices. Various 
slow release polymeric devices have been developed and tested in vivo in recent years^ for^ie 
controlled delivery of drugs, including proteinaceous biopharmaceuticals, and can be adapted for 
release of viral particles through the manipulation of the polymer composition and form. A variety 
of biocompatible polymers (including hydrogels), including both biodegradable and non-degradable 
polymers, can be used to form an implant for the sustained release of an the viral particles by cells 
implanted at a particular target site. Such embodiments of the present invention can be used for the 
delivery of an exogenously purified virus, which has been incorporated in the polymeric device, or 
for the delivery of viral particles produced by a cell encapsulated in the polymeric device. 

By choice of monomer composition or polymerization technique, the amount of water, 
porosity and consequent permeability characteristics can be controlled. The selection of the shape, 
size, polymer, and method for implantation can be determined on an individual basis according to the 



-20- 



disorder to be treated and the individual patient response. The generation of such implants is generally 
known in the art. See, for example, Concise Encyclopedia of Medical & Dental Materials, ed. by 
David Williams (MIT Press: Cambridge, MA, 1990); and the Sabel et al. U.S. Patent No. 4,883,666. 
In another embodiment of an implant, a source of cells producing a the recombinant virus is 
encapsulated in implantable hollow fibers. Such fibers can be pre-spun and subsequently loaded with 
the viral source (Aebischer et al. U.S. Patent No. 4,892,538; Aebischer et al. U.S. Patent No. 
5,106,627; Hoffinan et al. (1990) Expt Neurobiol 1 10:39-44; Jaeger et al. (1990) Prog. Brain Res. 
82:41-46; and Aebischer et al. (1991) J. Biomech. Eng. 1 13:178-183), or can be co-extruded with a 
polymer which acts to form a polymeric coat about the viral packaging cells (Lim U.S. Patent No. 
4,391,909; Sefton U.S. Patent No. 4,353,888; Sugamori et al. (1989) Trans. Am. Artif. Intern. Organs 
35:791-799; Sefton et al. (1987) Biotechnol. Bioeng. 29:1135-1143; and Aebischer et al. (1991) 
Biomaterials 12:50-55). Again, manipulation of the polymer can be carried out to provide for optimal 
release of viral particles. 

Chimeric polypeptides of the present invention can be designed by using molecular modeling. 
A computer model of serum albumin may be altered to include a selected heterologous sequence and 
the resulting structure may be submitted to calculations designed to determine how the resulting 
peptide will change in shape, how much strain the alteration introduces into the polypeptide, how the 
heterologous sequence is displayed in three dimensions, and other data relevant to the resulting 
structure of the chimeric polypeptide. Alternatively, the nature of the sequencejtobe ii^^dedjnighL 
belietermined by the calculation, based on knowledge of a receptor or binding pocket. In another 
embodiment, the calculations might best determine how to insert a desired sequence to maintain the 
tertiary structure of the serum albumin backbone, or to display the insertion in the proper orientation. 
Other calculational strategies will be known to those skilled in the art. Calculations such as these can 
be useful for directing the synthesis of chimeric polypeptides of the present invention in a time- and 
material-efficient manner, before actual synthesis and screening techniques begin. 

Methods for screening chimeric polypeptides of the present invention are well known in the 
art, independent of the use of computer modeling. The use of peptide libraries is one way of screening 
large numbers of polypeptides at once. In one screening assay, the candidate peptides are displayed 
on the surface of a cell or viral particle, and the ability of particular cells or viral particles to bind a 



-21- 



target molecule, such as a receptor protein via this gene product is detected in a "panning assay". For 
instance, the gene library can be cloned into the gene for a surface membrane protein of a bacterial 
cell, and the resulting chimeric polypeptide detected by panning (Ladner et al., WO 88/06630; Fuchs 
et al. (1991) Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 18:136-140). 

In an alternate embodiment, the peptide library is expressed as chimeric polypeptides on the 
surface of a viral particle. For instance, in the filamentous phage system, foreign peptide sequences 
can be expressed on the surface of infectious phage, thereby conferring two significant benefits. First, 
since these phage can be applied to affinity matrices at very high concentrations, a large number of 
phage can be screened at one time. Second, since each infectious phage displays the combinatorial 
gene product on its surface, if a particular phage is recovered from an affinity matrix in low yield, the 
phage can be amplified by another round of infection. The group of almost identical E. coli 
filamentous phages Ml 3, fd, and fl are most often used in phage display libraries, as either of the 
phage gin or gVIII coat proteins can be used to generate chimeric polypeptides without disrupting the 
ultimate packaging of the viral particle (Ladner et al. PCT publication WO 90/02809; Garrard et al., 
PCT publication WO 92/09690; Marks et al. (1992) J. Biol Chem. 267:16007-16010; Griffiths et al. 
(1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) 
PNAS 89:4457-4461). 

The field of combinatorial peptide libraries has been reviewed (Gallop et al. J. Med. Chem. 
1994, 37, 1233-1251), and additional techniques are known in the art (Gustin, K. Virology 1993, 193, 
-653-660rGoedd'erefai:u:s; Patent 5^374087^^ PCT publication W092/15679; Bass 

et al. Proteins: Structure, Function and Genetics 1990, 8, 309-3 14;Cunningham, B.C. Science 1990, 
247, 1461-1465; Lowman, H.B. Biochemistry 1991, 30, 10832-10838; Fowlkes et al. U.S. Patent No. 
5,789,184; Houghton, Proa Natl. Acad. Sci. U.S.A. 1985, 82, 5131-5135) for generating and 
screening peptide libraries. 

U.S. patent application 09/174,943, filed October 19, 1998, discloses a method for isolating 
biologically active peptides. Using the techniques disclosed therein, a chimeric polypeptide of the 
present invention may be developed which interacts with a chosen receptor. 

In a representative example, this method is utilized to identify polypeptides which have 
antiproliferative activity with respect to one or more types of cells. One of skill in the art will readily 
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be able to modify the procedures outlined below to find polypeptides with any desired activity. In the 
example, in the display mode, the chimeric polypeptide library can be panned with the target cells for 
which an antiproliferative is desired in order to enrich for polypeptides which bind to that cell. At that 
stage, the polypeptide library can also be panned against one or more control cell lines in order to 
remove polypeptides which bind the control cells. In this manner, the polypeptide library which is 
then tested in the secretion mode can be enriched for polypeptides which selectively bind target cells 
(relative to the control cells). Thus, for example, the display mode can produce a polypeptide library 
enriched for polypeptides which preferentially bind tumor cells relative to normal cells, which 
preferentially bind p53- cells relative to p53+ cells, which preferentially bind hair follicle cells 
relative to other epithelial cells, or any other differential binding characteristic. 

In the secretion mode, the polypeptides are tested for antiproliferative activity against the 
target cell using any of a number of techniques known in the art. For instance, BrdU or other 
nucleotide uptake can be measured as an indicator of proliferation. As above, the secretion mode can 
include negative controls in order to select for polypeptides with specific antiproliferative activity. 

In similar fashion, polypeptides can be isolated from the library based on their ability to 
induce apoptosis or cell lysis, for example, in a cell-selective manner. 

Also, this method can be used to identify polypeptides with angiogenic or antiangiogenic 
activity. For instance, the polypeptide library can be enriched for polypeptides that bind to endothelial 
cells but which do not bind to fibroblasts. The resulting sub-library can be screened for polypeptides 
which inhibit capillary endothelial cell proliferation and/or endothelial cell migration. Polypeptides 
scoring positive for one or both of these activities can also be tested for activity against other cell 
types, such as smooth muscle cells or fibroblasts, in order to select polypeptides active only against 
endothelial cells. 

Furthermore, this method can be used to identify anti-infective polypeptides, for example, 
which are active as anti-fungal or antibacterial agents. 

In addition, this assay can be used for identifying effectors of a receptor protein or complex 
thereof. In general, the assay is characterized by the use of a test cell which includes a target receptor 
or ion channel protein whose signal transduction activity can be modulated by interaction with an 
extracellular signal, the transduction activity being able to generate a detectable signal. 
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In general, such assays are characterized by the use of a mixture of cells expressing a target 
receptor protein or ion channel capable of transducing a detectable signal in the reagent cell. The 
receptor/channel protein can be either endogenous or heterologous. In combination with the disclosed 
detection means, a culture of the instant reagent cells will provide means for detecting agonists or 
antagonists of receptor function. 

The ability of particular polypeptides to modulate a signal transduction activity of the target 
receptor or channel can be scored for by detecting up or down-regulation of the detection signal. For 
example, second messenger generation (e.g., GTPase activity, phospholipid hydrolysis, or protein 
phosphorylation patterns as examples) can be measured directly. Alternatively, the use of an indicator 
gene can provide a convenient readout. In other embodiments a detection means consists of an 
indicator gene. In any event, a statistically significant change in the detection signal can be used to 
facilitate identification of compounds which modulate receptor or ion channel activities. 

By this method, polypeptides which induce a signal pathway from a particular receptor or 
channel can be identified. If a test polypeptide does not appear to induce the activity of the 
receptor/channel protein, the assay may be repeated as described above, and modified by the 
introduction of a step in which the reagent cell is first contacted with a known activator of the target 
receptor/channel to induce signal transduction, and the test peptide can be assayed for its ability to 
inhibit the activated receptor/channel, for example, to identify antagonists. In yet other embodiments, 
peptides can be screened for those which potentiate the response Jo_a toow^tiyatorof the receptor.. 

In particular, the assays can be used to test functional ligand-receptor or ligand-ion channel 
interactions for cell surface-localized receptors and channels. As described in more detail below, the 
subject assay can be used to identify effectors of, for example, G protein-coupled receptors, receptor 
tyrosine kinases, cytokine receptors, and ion channels. In certain embodiments the method described 
herein is used for identifying ligands for "orphan receptors 11 for which no ligand is known. 

In some examples, the receptor is a cell surface receptor, such as: a receptor tyrosine kinase, 
for example, an EPH receptor; an ion channel; a cytokine receptor; an multisubunit immune 
recognition receptor, a chemokine receptor; a growth factor receptor, or a G-protein coupled receptor, 
such as a chemoattracttractant peptide receptor, a neuropeptide receptor, a light receptor, a 
neurotransmitter receptor, or a polypeptide hormone receptor. 



-24- 



Preferred G protein-coupled receptors include a 1 A- adrenergic receptor, alB-adrenergic 
receptor, a2-adrenergic receptor, a2B-adrenergic receptor, 1 -adrenergic receptor, (32-adrenergic 
receptor, (33 -adrenergic receptor, ml acetylcholine receptor (AChR), m2 AChR, m3 AChR, m4 
AChR, m5 AChR, Dl dopamine receptor, D2 dopamine receptor, D3 dopamine receptor, D4 
dopamine receptor, D5 dopamine receptor, Al adenosine receptor, A2b adenosine receptor, 5-HTla 
receptor, 5-HTlb receptor, 5HTl-like receptor, 5-HTld receptor, 5HTld-like receptor, 5HTld beta 
receptor, substance K (neurokinin A) receptor, fMLP receptor, fMLP-like receptor, angiotensin II type 
1 receptor, endothelin ETA receptor, endothelin ETB receptor, thrombin receptor, growth hormone- 
releasing hormone (GHRH) receptor, vasoactive intestinal peptide receptor, oxytocin receptor, 
somatostatin SSTR1 and SSTR2, SSTR3, cannabihoid receptor, follicle stimulating hormone (FSH) 
receptor, leutropin (LH/HCG) receptor, thyroid stimulating hormone (TSH) receptor, thromboxane 
A2 receptor, platelet-activating factor (PAF) receptor, C5a anaphylatoxin receptor, Interleukin 8 (IL- 
8) IL-8RA, IL-8RB, Delta Opioid receptor, Kappa Opioid receptor, mip-l/RANTES receptor, 
Rhodopsin, Red opsin, Green opsin, Blue opsin, metabptropic glutamate mGluRl-6, histamine H2 
receptor, ATP receptor, neuropeptide Y receptor, amyloid protein precursor receptor, insulin-like 
growth factor II receptor, bradykinin receptor, gonadotropin-releasing hormone receptor, 
cholecystokinin receptor, melanocyte stimulating hormone receptor receptor, antidiuretic hormone 
receptor, glucagon receptor, and adrenocorticotropic hormone II receptor. 

Preferred EPH receptors inlcude eph, elk, eck, sek^mek4 r h^ 9 JiekZ 9 jeek 9 ..erk 9 -tyroJ^^o4f- 
fyrd5^iyro6~tyroll 9 cek4 9 cek5 9 cek6, cek7 9 cek8, cek9 9 ceklO, bsk 9 rtkl 9 rtk2 f rtk3 9 mykl 9 myk2 9 ehkl 9 
ehk2 9 pagliaccio 9 htk 9 erk and nuk receptors. 

A Cytokine Receptors 

In one example the target receptor is a cytokine receptor. Cytokines are a family of soluble 
mediators of cell-to-cell communication that includes interleukins, interferons, and colony- 
stimulating factors. The characteristic features of cytokines lie in their functional redundancy and 
pleiotropy. Most of the cytokine receptors that constitute distinct superfamilies do not possess 
intrinsic protein tyrosine kinase domains, yet receptor stimulation usually invokes rapid tyrosine 
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phosphorylation of intracellular proteins, including the receptors themselves. Many members of the 
cytokine receptor superfamily activate the Jak protein tyrosine kinase family, with resultant 
phosphorylation of the STAT transcriptional activator factors. IL-2, EL-7, EL-2 and Interferon y have 
all been shown to activate Jak kinases (Frank et al (1995) Proc Natl Acad Sci USA 92:7779-7783); 
Scharfe et al. (1995) Blood 86:2077-2085); (Bacon et al. (1995) Proc Natl Acad Sci USA 92:7307- 
7311); and (Sakatsume et al (1995) 7. Biol Chem 270:17528-17534). Events downstream of Jak 
phosphorylation have also been elucidated. For example, exposure of T lymphocytes to IL-2 has been 
shown to lead to the phosphorylation of signal transducers and activators of transcription (STAT) 
proteins STAT la, STAT2p, and STAT3, as well as of two STAT-related proteins, p94 and p95. The 
STAT proteins were found to translocate to the nucleus and to bind to a specific DNA sequence, thus 
suggesting a mechanism by which IL-2 may activate specific genes involved in immune cell function 
(Frank et al. supra). Jak3 is associated with the gamma chain of the IL-2, IL-4, and IL-7 cytokine 
receptors (Fujii et al. (1995) Proc Natl Acad Sci 92:5482-5486) and (Musso et al (1995) J Exp Med. 
181:1425-1431). The Jak kinases have also been shown to be activated by numerous ligands that 
signal via cytokine receptors such as, growth hormone and erythropoietin and IL-6 (Kishimoto (1994) 
Stem cells Suppl 12:37-44). 

Detection means which may be scored for in the present assay, in addition to direct detection 
of second messengers, such as by changes in phosphorylation, includes reporter constructs or 
indicator genes which include transcriptional regulatory elements responsiyeto jthe STAXproteins. ._ 
~ Described infra. ~ "7 

B Multisubunit Immune Recognition Receptor (MIRR). 

In another example the receptor is a multisubunit receptor. Receptors can be comprised of 
multiple proteins referred to as subunits, one category of which is referred to as a multisubunit 
receptor is a multisubunit immune recognition receptor (MIRR). MIRRs include receptors having 
multiple noncovalently associated subunits and are capable of interacting with src-family tyrosine 
kinases. MIRRs can include, but are not limited to, B cell antigen receptors, T cell antigen receptors, 
Fc receptors and CD22. One example of an MIRR is an antigen receptor on the surface of a B cell. 
To further illustrate, the MIRR on the surface of a B cell comprises membrane-bound 
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immunoglobulin (mlg) associated with the subunits Ig-a and Ig- or Ig-y, which forms a complex 
capable of regulating B cell function when bound by antigen. An antigen receptor can be functionally 
linked to an amplifier molecule in a manner such that the amplifier molecule is capable of regulating 
gene transcription. 

Src-family tyrosine kinases are enzymes capable of phosphorylating tyrosine residues of a 
target molecule. Typically, a src-family tyrosine kinase contains one or more binding domains and 
a kinase domain. A binding domain of a src-family tyrosine kinase is capable of binding to a target 
molecule and a kinase domain is capable of phosphorylating a target molecule bound to the kinase. 
Members of the src family of tyrosine kinases are characterized by an N-terminal unique region 
followed by three regions that contain different degrees of homology among all the members of the 
family. These three regions are referred to as src homology region 1 (SHI), src homology region 2 
(SH2) and src homology region 3 (SH3). Both the SH2 and SH3 domains are believed to have protein 
association functions important for the formation of signal transduction complexes. The amino acid 
sequence of an N-terminal unique region, varies between each src-family tyrosine kinase. An N- 
terminal unique region can be at least about the first 40 amino acid residues of the N-terminal of a 
src-family tyrosine kinase. 

Syk-family kinases are enzymes capable of phosphorylating tyrosine residues of a target 
molecule. Typically, a syk-family kinase contains one or more binding domains and a kinase domain. 
A binding dom ain of a syk-familyjyro sine kinase _is_capable otbinding- to a target-molecule anda 
kinase domain is capable of phosphorylating a target molecule bound to the kinase. Members of the 
syk family of tyrosine kinases are characterized by two SH2 domains for protein association function 
and a tyrosine kinase domain. 

A primary target molecule is capable of further extending a signal transduction pathway by 
modifying a second messenger molecule. Primary target molecules can include, but are not limited 
to, phosphatidylinositol 3-kinase (PI-3K), P21 ras GAPase-activating protein and associated P190 and 
P62 protein, phospholipases such as PLCyl and PLC 2, MAP kinase, She and VAV. A primary target 
molecule is capable of producing second messenger molecule which is capable of further amplifying 
a transduced signal. Second messenger molecules include, but are not limited to diacylglycerol and 
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inositol 1,4,5-triphosphate (IP3). Second messenger molecules are capable of initiating physiological 
events which can lead to alterations in gene transcription. For example, production of IP3 can result 
in release of intracellular calcium, which can then lead to activation of calmodulin kinase n, which 
can then lead to serine phosphorylation of a DNA binding protein referred to as ets-1 proto-onco- 
protein. Diacylglycerol is capable of activating the signal transduction protein, protein kinase C which 
affects the activity of the API DNA binding protein complex. Signal transduction pathways can lead 
to transcriptional activation of genes such as c-fos, egr-1, and c-myc. 

She can be thought of as an adaptor molecule. An adaptor molecule comprises a protein that 
enables two other proteins to form a complex (e.g., a three molecule complex). She protein enables 
a complex to form which includes Grb2 and SOS. She comprises an SH2 domain that is capable of 
associating with the SH2 domain of Grb2. 

Molecules of a signal transduction pathway can associate with one another using recognition 
sequences. Recognition sequences enable specific binding between two molecules. Recognition 
sequences can vary depending upon the structure of the molecules that are associating with one 
another. A molecule can have one or more recognition sequences, and as such can associate with one 
or more different molecules. 

Signal transduction pathways for MIRR complexes are capable of regulating the biological 
functions of a cell. Such functions can include, but are not limited to the ability of a cell to grow, to 
differentiate and to secrete cellular products. MIRR-induced signal transduction pathways can 
-regulate the~biotog^ of cells involved in particular responses by an 

animal, such as immune responses, inflammatory responses and allergic responses. Cells involved 
in an immune response can include, for example, B cells, T cells, macrophages, dendritic cells, 
natural killer cells and plasma cells. Cells involved in inflammatory responses can include, for 
example, basophils, mast cells, eosinophils, neutrophils and macrophages. Cells involved in allergic 
responses can include, for example mast cells, basophils, B cells, T cells and macrophages. 

In certain examples, the detection signal is a second messenger, such as a phosphorylated src- 
like protein, including reporter constructs or indicator genes which include transcriptional regulatory 
elements such as serum response element (SRE), 12-0-tetradecanoyl-phorbol-13-acetate response 
element, cyclic AMP response element, c- fos promoter, or a CREB-responsive element. 
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C. Receptor tyrosine kinases . 

In still another example, the target receptor is a receptor tyrosine kinase. The receptor tyrosine 
kinases can be divided into five subgroups on the basis of structural similarities in their extracellular 
domains and the organization of the tyrosine kinase catalytic region in their cytoplasmic domains. 
Sub-groups I (epidermal growth factor (EGF) receptor-like), II (insulin receptor-like) and the eph/eck 
family contain cysteine-rich sequences (Hirai et al., (1987) Science 238:1717-1720 and Lindberg and 
Hunter, (1990) Mol Cell Biol 10:6316-6324). The functional domains of the kinase region of these 
three classes of receptor tyrosine kinases are encoded as a contiguous sequence ( Hanks et al. (1988) 
Science 241:42-52). Subgroups HI (platelet-derived growth factor (PDGF) receptor-like) and IV (the 
fibro-blast growth factor (FGF) receptors) are characterized as having immunoglobulin (Ig)-like folds 
in their extracellular domains, as well as having their kinase domains divided in two parts by a 

3 variable stretch of unrelated amino acids (Yanden and Ullrich (1988) supra and Hanks et al. (1988) 

.=j? supra). 

4* The family with by far the largest number of known members is the EPH family. Since the 

Lh description of the prototype, the EPH receptor (Hirai et al. ( 1 987) Science 238 : 1 7 1 7- 1 720), sequences 
have been reported for at least ten members of this family, not counting apparently orthologous 
Cj receptors found in more than one species. Additional partial sequences, and the rate at which new 
yh members are still being reported, suggest the family is even larger (Maisonpierre et al. (1993)^ 

„|? 0nc0ge«e-8:3277^ Henkemeyer et al. (1994) 

H Oncogene 9:1001-1014; Ruiz et al. (1994) Mech Dev 46:87-100; Xu et al. (1994) Development 
120:287-299; Zhou et al. (1994) J Neurosci Res 37:129-143; and references in Tuzi and Gullick 
(1994) Br J Cancer 69:417-421). Remarkably, despite the large number of members in the EPH 
family, all of these molecules were identified as orphan receptors without known ligands. 

The expression patterns determined for some of the EPH family receptors have implied 
important roles for these molecules in early vertebrate development. In particular, the timing and 
pattern of expression of sek 9 mek4 and some of the other receptors during the phase of gastrulation 
and early organogenesis has suggested functions for these receptors in the important cellular 
interactions involved in patterning the embryo at this stage (Gilardi-Hebenstreit et al. (1992) 
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Oncogene 7:2499-2506; Nieto et al. (1992) Development 1 16:1 137-1 150; Henkemeyer et al., supra; 
Ruiz et al., supra; and Xu et al., supra). Sek, for example, shows a notable early expression in the two 
areas of the mouse embryo that show obvious segmentation, namely the somites in the mesoderm and 
the rhombomeres of the hindbrain; hence the name sek, for segmentally expressed kinase (Gilardi- 
Hebenstreit et al, supra; Nieto et al., supra). As in Drosophila, these segmental structures of the 
mammalian embryo are implicated as important elements in establishing the body plan. The 
observation that Sek expression precedes the appearance of morphological segmentation suggests a 
role for sek in forming these segmental structures, or in determining segment-specific cell properties 
such as lineage compartmentation (Nieto et al., supra). Moreover, EPH receptors have been 
implicated, by their pattern of expression, in the development and maintenance of nearly every tissue 
in the embryonic and adult body. For instance, EPH receptors have been detected throughout the 
nervous system, the testes, the cartilaginous model of the skeleton, tooth primordia, the infundibular 
component of the pituitary, various epithelial tissues, lung, pancreas, liver and kidney tissues. 
Observations such as this have been indicative of important and unique roles for EPH family kinases 
in development and physiology, but further progress in understanding their action has been severely 
limited by the lack of information on their ligands. 

As used herein, the terms "EPH receptor" or "EPH-type receptor" refer to a class of receptor 
tyrosine kinases, comprising at least eleven paralogous genes, though many more orthologs exist 
within this class, e.g., homologs from different species. EPH receptors, in general, are a discrete 
group of receptors "related by homology and easily recognizable, for example, they are typically 
characterized by an extracellular domain containing a characteristic spacing of cysteine residues near 
the N-terminus and two fibronectin type HI repeats (Hirai et al. (1987) Science 238:1717-1720; 
Lindberg et al. (1990) Mol Cell Biol 10:6316-6324; Chan et al. (1991) Oncogene 6:1057-1061; 
Maisonpierre et al. (1993) Oncogene 8:3277-3288; Andres et al. (1994) Oncogene 9:1461-1467; 
Henkemeyer et al. (1994) Oncogene 9:1001-1014; Ruiz et al. (1994) Mech Dev 46:87-100; Xu et al. 
(1994) Development 120:287-299; Zhou et al. (1994) JNeurosci Res 37:129-143; and references in 
Tuzi and Gullick (1994) Br J Cancer 69:417-421). Exemplary EPH receptors include the eph, elk, 
eck, sek, mek4, hek, hek2, eek, erk, tyro I, tyro4, tyroS, tyro6, tyroll, cek4, cekS, cek6, cek7, cek8, 
cek9, ceklO, bsk, rtkl, rtk2, rtk3, mykl, myk2, ehkl, ehk2,pagliaccio, htk, erk and nuk receptors. The 
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term "EPH receptor" refers to the membrane form of the receptor protein, as well as soluble 
extracellular fragments which retain the ability to bind the ligand of the present invention. 

In certain examples, the detection signal is provided by detecting phosphorylation of 
intracellular proteins, e.g., MEKKs, MEKs, or Map kinases, or by the use of reporter constructs or 
indicator genes which include transcriptional regulatory elements responsive to c-fos and/or c-jun. 
Described infra, 

D. G Protein-Coupled Receptors. 

One family of signal transduction cascades found in eukaryotic cells utilizes heterotrimeric 
"G proteins." Many different G proteins are known to interact with receptors. G protein signaling 
systems include three components: the receptor itself, a GTP-binding protein (G protein), and an 
intracellular target protein. 

The cell membrane acts as a switchboard. Messages arriving through different receptors can 
produce a single effect if the receptors act on the same type of G protein. On the other hand, signals 
activating a single receptor can produce more than one effect if the receptor acts on different kinds 
of G proteins, or if the G proteins can act on different effectors. 

In their resting state, the G proteins, which consist of alpha (a), beta (P) and gamma (y) 
subunits, are complexed with the nucleotide guanosine diphosphate (GDP) and are in contact with 
receptors. When a hormone or other first messenger binds to receptor,_the jeceptor^changes 
conformation and this alters its interaction with the G protein. This spurs the a subunit to release 
GDP, and the more abundant nucleotide guanosine triphosphate (GTP), replaces it, activating the G 
protein. The G protein then dissociates to separate the a subunit from the still complexed beta and 
gamma subunits. Either the Ga subunit, or the Gpy complex, depending on the pathway, interacts 
with an effector. The effector (which is often an enzyme) in turn converts an inactive precursor 
molecule into an active "second messenger," which may diffuse through the cytoplasm, triggering a 
metabolic cascade. After a few seconds, the Ga converts the GTP to GDP, thereby inactivating itself. 
The inactivated Ga may then reassociate with the Gpy complex. 

Hundreds, if not thousands, of receptors convey messages through heterotrimeric G proteins, 
of which at least 17 distinct forms have been isolated. Although the greatest variability has been seen 
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in the a subunit, several different (5 and y structures have been reported. There are, additionally, 
several different G protein-dependent effectors. 

Most G protein-coupled receptors are comprised of a single protein chain that is threaded 
through the plasma membrane seven times. Such receptors are often referred to as seven- 
transmembrane receptors (STRs). More than a hundred different STRs have been found, including 
many distinct receptors that bind the same ligand, and there are likely many more STRs awaiting 
discovery. 

In addition, STRs have been identified for which the natural ligands are unknown; these 
receptors are termed "orphan" G protein-coupled receptors, as described above. Examples include 
receptors cloned by Neote et al. (1993) Cell 72, 415; Kouba et al. FEBS Lett (1993) 321, 173; 
Birkenbach et al.(l 993)7. Virol 67,2209. 

The 'exogenous receptors' of this example may be any G protein-coupled receptor which is 
exogenous to the cell which is to be genetically engineered for the purpose of the present invention. 
This receptor may be a plant or animal cell receptor. Screening for binding to plant cell receptors may 
be useful in the development of, for example, herbicides. In the case of an animal receptor, it may be 
of invertebrate or vertebrate origin. If an invertebrate receptor, an insect receptor is preferred, and 
would facilitate development of insecticides. The receptor may also be a vertebrate, more preferably 
a mammalian, still more preferably a human, receptor. The exogenous receptor is also preferably a 
seven transmembrane segment receptor. 

Known ligands for G protein coupled receptors include: purines and nucleotides, such as 
adenosine, cAMP, ATP, UTP, ADP, melatonin and the like; biogenic amines (and related natural 
ligands), such as 5-hydroxytryptamine, acetylcholine, dopamine, adrenaline, adrenaline, adrenaline., 
histamine, noradrenaline, noradrenaline, noradrenaline., tyramine/octopamine and other related 
compounds; peptides such as adrenocorticotrophic hormone (acth), melanocyte stimulating hormone 
(msh), melanocortins, neurotensin (nt), bombesin and related peptides, endothelins, cholecystokinin, 
gastrin, neurokinin b (nk3), invertebrate tachykinin-like peptides, substance k (nk2), substance p 
(nkl), neuropeptide y (npy), thyrotropin releasing-factor (trf), bradykinin, angiotensin ii, beta- 
endorphin, c5a anaphalatoxin, calcitonin, chemokines (also called intercrines), corticotrophic 
releasing factor (erf), dynorphin, endorphin, fmlp and other formylated peptides, follitropin (fsh), 
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fungal mating pheremones, galanin, gastric inhibitory polypeptide receptor (gip), glucagon-like 
peptides (glps), glucagon, gonadotropin releasing hormone (gnrh), growth hormone releasing 
hormone(ghrh), insect diuretic hormone, interleukin-8, leutropin (lh/hcg), met-enkephalin, opioid 
peptides, oxytocin, parathyroid hormone (pth) and pthrp, pituitary adenylyl cyclase activiating peptide 
(pacap), secretin, somatostatin, thrombin, thyrotropin (tsh), vasoactive intestinal peptide (vip), 
vasopressin, vasotocin; eicosanoids such as ip-prostacyclin, pg-prostaglandins, tx-thromboxanes; 
retinal based compounds such as vertebrate 1 1-cis retinal, invertebrate 1 1-cis retinal and other related 
compounds; lipids and lipid-based compounds such as cannabinoids, anandamide, lysophosphatidic 
acid, platelet activating factor, leukotrienes and the like; excitatory amino acids and ions such as 
calcium ions and glutamate. 

Suitable examples of G-protein coupled receptors include, but are not limited to, 
dopaminergic, muscarinic cholinergic, a-adrenergic, b-adrenergic, opioid (including delta and mu), 
cannabinoid, serotoninergic, and GABAergic receptors. Preferred receptors include the 5HT family 
of receptors, dopamine receptors, C5a receptor and FPRL-1 receptor, cyclo-histidyl-proline- 
diketoplperazine receptors, melanocyte stimulating hormone release inhibiting factor receptor, and 
receptors for neurotensin, thyrotropin releasing hormone, calcitonin, cholecytokinin-A, neurokinin-2, 
histamine-3, cannabinoid, melanocortin, or adrenomodulin, neuropeptide- Yl or galanin. Other 
suitable receptors are listed in the art. The term 'receptor,' as used herein, encompasses both naturally 
occurring and mutant receptors. 

Manyof these G protein-coupled receptors, like the yeast a- and a-factor receptors, contain 
seven hydrophobic amino acid-rich regions which are assumed to lie within the plasma membrane. 
Specific human G protein-coupled STRs for which genes have been isolated and for which expression 
vectors could be constructed include those listed herein and others known in the art. Thus, the gene 
would be operably linked to a promoter functional in the cell to be engineered and to a signal 
sequence that also functions in the cell. For example in the case of yeast, suitable promoters include 
Ste2 . Ste3 and gal 10 . Suitable signal sequences include those of Ste2, Ste3 and of other genes which 
encode proteins secreted by yeast cells. Preferably, when a yeast cell is used, the codons of the gene 
would be optimized for expression in yeast. See Hoekema et al.,(1987) Mol Cell Biol, 7:2914-24; 
Sharp, et al., (1986)14:5125-43. 
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The homology of STRs is discussed in Dohlman et al., Ann. Rev, Biochem., (1991) 60:653-88. 
When STRs are compared, a distinct spatial pattern of homology is discernible. The transmembrane 
domains are often the most similar, whereas the N- and C -terminal regions, and the cytoplasmic loop 
connecting transmembrane segments V and VI are more divergent. 

The functional significance of different STR regions has been studied by introducing point 
mutations (both substitutions and deletions) and by constructing chimeras of different but related 
STRs. Synthetic peptides corresponding to individual segments have also been tested for activity. 
Affinity labeling has been used to identify ligand binding sites. 

It is conceivable that when the host cell is a yeast cell, a foreign receptor will fail to 
functionally integrate into the yeast membrane, and there interact with the endogenous yeast G 
protein. More likely, either the receptor will need to be modified (e.g., by replacing its V-VI loop with 
that of the yeast STE2 or STE3 receptor), or a compatible G protein should be provided; 

If the wild-type exogenous G protein-coupled receptor cannot be made functional in yeast, 
it may be mutated for this purpose. A comparison would be made of the amino acid sequences of the 
exogenous receptor and of the yeast receptors, and regions of high and low homology identified. Trial 
mutations would then be made to distinguish regions involved in ligand or G protein binding, from 
those necessary for functional integration in the membrane. The exogenous receptor would then be 
mutated in the latter region to more closely resemble the yeast receptor, until functional integration 
was achieved. If this were insufficient to achieve functionality, mutations would next be made in the 
^"regions involved in'G profein'bindin^ Mutations would be made in regions involved in ligand 
binding only as a last resort, and then an effort would be made to preserve ligand binding by making 
conservative substitutions whenever possible. 

Preferably, the yeast genome is modified so that it is unable to produce the yeast receptors 
which are homologous to the exogenous receptors in functional form. Otherwise, a positive assay 
score might reflect the ability of a peptide to activate the endogenous G protein-coupled receptor, and 
not the receptor of interest. 

(i). Chemoattractant receptors 
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The N-formyl peptide receptor is a classic example of a calcium mobilizing G protein-coupled 
receptor expressed by neutrophils and other phagocytic cells of the mammalian immune system 
(Snyderman et al. (1988) In Inflammation: Basic Principles and Clinical Correlates, pp. 309-323). 
N-Formyl peptides of bacterial origin bind to the receptor and engage a complex activation program 
that results in directed cell movement, release of inflammatory granule contents, and activation of a 
latent NADPH oxidase which is important for the production of metabolites of molecular oxygen. 
This pathway initiated by receptor-ligand interaction is critical in host protection from pyogenic 
infections. Similar signal transduction occurs in response to the inflammatory peptides C5a and BL-8. 

Two other formyl peptide receptor like (FPRL) genes have been cloned based on their ability 
to hybridize to a fragment of the NFPR cDNA coding sequence. These have been named FPRL1 
(Murphy et al. (1992) 1 Biol Chem. 261:1 '631 '-76 43) and FPRL2 (Ye et al. (1992) Biochem Biophys 
Res. Comm. 184:582-589). FPRL2 was found to mediate calcium mobilization in mouse fibroblasts 
transfected with the gene and exposed to formyl peptide. In contrast, although FPRL1 was found to 
be 69% identical in amino acid sequence to NFPR, it did not bind prototype N-formyl peptides 
ligands when expressed in heterologous cell types. This lead to the hypothesis of the existence of an 
as yet unidentified ligand for the FPRL1 orphan receptor (Murphy et al. supra). 

(ii.) G proteins 

In the case of an exogenous Gprotein-coupled receptor, the yeast cell must be able to produce 
—a G protein-which is activated by"the"exogenous receptor, and which can in turn activate the yeast 
effector(s). The art suggests that the endogenous yeast Got subunit (e.g., GPA) will be often be 
sufficiently homologous to the "cognate" Got subunit which is natively associated with the exogenous 
receptor for coupling to occur. More likely, it will be necessary to genetically engineer the yeast cell 
to produce a foreign Got subunit which can properly interact with the exogenous receptor. For 
example, the Got subunit of the yeast G protein may be replaced by the Got subunit natively associated 
with the exogenous receptor. 

Dietzel and Kurjan, (1987) Cell 50:1001) demonstrated that rat Gas functionally coupled to 
the yeast GfJy complex. However, rat Goti2 complemented only when substantially overexpressed, 
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while GaO did not complement at all. Kang, et al, Mol Cell Biol, (1990)10:2582). Consequently, 
with some foreign Ga subunits, it is not feasible to simply replace the yeast Got. 

If the exogenous G protein coupled receptor is not adequately coupled to yeast Gpy by the Ga 
subunit natively associated with the receptor, the Ga subunit may be modified to improve coupling. 
These modifications often will take the form of mutations which increase the resemblance of the Ga 
subunit to the yeast Ga while decreasing its resemblance to the receptor-associated Ga. For example, 
a residue may be changed so as to become identical to the corresponding yeast Ga residue, or to at 
least belong to the same exchange group of that residue. After modification, the modified Ga subunit 
might or might not be "substantially homologous" to the foreign and/or the yeast Ga subunit. 

The modifications are preferably concentrated in regions of the Ga which are likely to be 
involved in Gpy binding. In some examples, the modifications will take the form of replacing one or 
more segments of the receptor-associated Ga with the corresponding yeast Ga segment(s), thereby 
forming a chimeric Ga subunit. (For the purpose of the appended claims, the term "segment" refers 
to three or more consecutive amino acids.) In other examples, point mutations may be sufficient. 

This chimeric Ga subunit will interact with the exogenous receptor and the yeast Gpy 
complex, thereby permitting signal transduction. While use of the endogenous yeast Gpy is preferred, 
if a foreign or chimeric Gpy is capable of transducing the signal to the yeast effector, it may be used 
instead. 

Although-many of the techniques presented abo'verequire specific knowledge of a receptor 

active in a particular biological pathway, it will be recognized by those skilled in the art that such 
knowledge is not required for the screening of a library of chimeric polypeptides of the present 
invention. Rather, cell-based assays are well known in the art in which cells of a selected phenotype 
can be used to screen chimeric polypeptides for those which induce a particular alteration in the 
phenotype. In this way, chimeric polypeptides can be found that have a desired biological function 
that is not understood on a molecular level. 

Exemplification 
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The invention now being generally described, it will be more readily understood by reference 
to the following examples which are included merely for purposes of illustration of certain aspects 
and embodiments of the present invention, and are not intended to limit the invention. 

Serum albumin loop regions. A space-filling model of human serum albumin (HSA) is 
shown in Figure 1. The tertiary structure of HSA reveals the presence often approximate helical 
regions or loops, each constrained by disulfide bonded cysteine pairs. The space-filling model was 
used to predict loop regions that are exposed on the surface of the protein. Two amino acid segments 
were chosen to represent surface exposed regions (loop 53-62 and loop 360-369) and a third to 
represent a region assumed to be buried within the protein (loop 450-463). These and other candidate 
loops (Cys 53 -Cys^^ -Cyi 71 , 

Cys 476 -Cys 487 , and Cys 558 -Cys 567 ) are depicted in Figures 4A-L 

Myc epitope display in MSA loop regions. In order to determine whether the predicted loops 
were indeed exposed on the surface of the Sdbumin molecule, mouse serum albumin (MSA) was 
modified to include the myc epitope, EQKLISH3DL. The myc epitope was inserted in the middle of 
each of three amino acid segments: between amino acids 57-58 for loop 53-62, amino acids 364-365 
for loop 360-369 and amino acids 467-468 for loop 450-467. Cos7 cells were transfected with either 
wild type MSA or the various myc containing MSA constructs. The presence of the proteins in the 
■""medium was fircUiSermined by Western blot analysis using antibodies specific for MSA and the myc 
epitope. As can be seen in the left half of Figure 2, onw samples from media from cells transfected 
with MSA or MSA-Myc reveal the presence of the albumin protein. Additionally, only the samples 
from cells transfected with MSA-Myc are positive foAthe myc epitope. As the samples are all 
denatured by virtue of the SDS-PAGE system, this analysis does not allow for the differentiation of 
myc epitopes that would be exposed on the surface versus one that was buried within the protein. For 
this analysis immunoprecipitation with the myc-specific antioody was utilized. In this experiment, 
the conditioned media was either mixed directly with the antirody (N, native) or first denatured in 
the presence of 0.1% SDS, 1 mM 6-mercapthoethanol and heat (1Q0 °C for 10 min) and then antibody 
added (D, denatured). Following immunoprecipitation the presence of the proteins that could be 
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precipitated by the myc antibody were revealed by Western blot analysis using the MSA specific 
antibody. The right panel of Figure 2 shows that\as predicted, the albumin proteins with myc inserted 
in loops 53-62 and 360-369 were bound by the myc antibody regardless of whether the protein was 
in its native or denatured form. On the other hand\ when myc was inserted in the predicted buried 
region, loop 450-463, the protein only bound thAantibody when it was first denatured. This 
experiment clearly demonstrates that loops 53-62 and 3W)-369 are exposed on the surface of the MSA 
protein and therefore good for display. Additionally, they450-463 loop is buried. 

Inhibition of bovine capill^y endothelial cells (BCE) MSA-RGD. The goal of this 
experiment was determine the functionW MSA with the RGD peptide (VRGDF) displayed on the 
surface of the protein in the loop 53-58 recion (MSA-myc-RGD). RGD was chosen, as this peptide 
can efficiently bind to avp3 integrin receptors on endothelial cells and inhibit their proliferation. 
Triplicate wells of Cos7 cells were transfected with the following constructs: MSA-myc (the myc 
epitope was added to the C-terminal tail of MSA in this iteration); MSA-myc-RGD; or pAM7-stuffer. 
These Cos7 cells were grown in the lower chamber of a Transwell® tissue culture plate with BCE 
cells in the upper chamber. To stimulate growth of the BCE cells, FGF was added to the lower 
chamber or not in the case of no FGF control and tire cells allowed to grow for 72 hours. To one set 
of wells, those with pAM7-stuffer, 6.25 jiM c-RuJD peptide was also added. Cell growth was 
determined by a Calcein-binding fluorescence assay.YThe left panel of Figure 3 is a graph of the 
— opticardensity (OD)T6r~each. The data reveals the addition of FGF results in a 2-fold stimulation of 
growth of the BCE cells. This growth was inhibited by tMe c-RGD peptide and also by the secreted 
MSA-myc-RGD protein. The right panel is a different way of looking at the same data. In this 
instance the degree of inhibition of growth is graphed V for each. The data shows that the 
MSA-Myc-RGD protein inhibited the growth of the BCE cellM?y 53% and the degree of inhibition 
was equivalent to that of the added RGD peptide. The RGD peptide displayed on the surface of the 
MSA molecule inhibited BCE cell growth as efficiently as the endogenously added free RGD peptide 
demonstrating that the peptide retains its activity in the looped orientation. 
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Inhibition of BCE and HUVEC proliferation by serum albumin-EC binding peptide 
fusions. This experiment was designed to demonstrate the inhibition of BCE and HUVEC cell 
proliferation by purified mouse serum albumin (MSA) proteins that displayed endothelial cell binding 
(EC) peptides. In the MSA-peptide fusions the peptide sequence was inserted into a cysteine 
constrained loop between amino acids 53 and 62. The proteins were produced by COS-7 cells that 
were transfected with expression plasmids that directed the synthesis and secretion of the particular 
recombinant protein. As shown in Figure 5, in the MSA-9G5, MSA-1 1B3 and MSA-RGD constructs 
the inserted peptides replaced the naturally occurring residues of MSA between cys53-cys62. In 
MSA-1H5 and MSA-myc constructs (negative control), the peptides were inserted into the loop at 
amino acid glu57. Figures 6 and 7 show the inhibitory effect of the purified proteins on the 
proliferation of BCE and HUVEC cells that were stimulated by FGF. 

Experimental design of the EC proliferation experiments 
Protein production and concentration 

COS7-L cells were transfected with protein expression constructs expressing: 

1. MSA, full-length mouse serum albumin (negative control) 

2. MSA-RGD, in which the RGD sequence (VRGDF) replaces the MSA sequence between Cys 53 
and Cys 62 \ 

3; MSA-1TB3, in~whiclTthe _ n-B3 peptideVsequence (PSTLRAQ) replaces the MSA sequence 
between Cys 53 and Cys 62 \ 

4. MSA-1H5, in which the 1-H5 peptide sequeribe (HTKQIPRHIYSA) is inserted between Glu 57 
and Ser 58 within the Cys 53 and Cys 62 loop of MSA 

5. MSA-9G5, in which the 9-G5 peptide sequence (DSHKRLK) replaces the MSA sequence between 
Cys 53 and Cys 62 \ 

6. MSA-myc, in which the Myc epitope peptide sequence (EQKLISEEDL) is inserted between Glu 
57 and Ser 58 within the Cys 53 and Cys 62 loop of MSA (negative control) 
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The transfected C0S7-L cells were cultured in defined serum-free media (VP-SFM). Each 
day for 5 days, the conditioned media were collected from the cells, centrifuged to remove dead cells 
and other cellular debris, and then frozen. The 5 days-worth of cultured media were pooled and 
concentrated 500-fold using a Centiprep-80 with a molecular weight cut-off of 50 (for MSA, 
MSA-RGD, MSA-9G5) or a molecular weight cut-off of 30 (for MSA-myc, MSA-1 1B3, MSA-1H5). 
The concentration of the albumin proteins was determined by Western blot analysis of each 
preparation using a rabbit anti-MSA antibody and using purified MSA of known concentration to 
generate a standard curve. Following development of the blot and exposure to film the 
autoradiographs were analyzed using the Gel Doc 1000 image analysis system and Molecular Analyst 
software (BioRad). 

BCE Proliferation Assays 

On day zero, bovine capillary endothelial cells (BCE) at passage 1 1 were plated in 96-well 
tissue culture plates at a density of 2 X 103 cells per well in 100 ml 5% calf serum (CS)/ DMEM 
supplemented with penicillin/streptomycin (PS). The cells were then incubated overnight in an 
atmosphere of 10 % C0 2 , 37 °C. 

On day one, the media was changed to 1 50 ml 2% CS/DMEM/PS. The albumin proteins were 
added to the first well as 8.75 ml which contains an additional 150 ml of 2% CS/DMEM/PS. 150 ml 
was then removed from this well and added to the next well resulting in a 1 :2 dilution of the protein. 
"This process~was fepeat^Toratotal of six times each in triplicate. 50 ml of 4 ng/ml FGF (final 
concentration: 1 ng/ml FGF) was then added to each well and the plates incubated as above for 72 
h. A synthetic peptide of cyclic RGD (c-RGD) at a concentration of 4.1 mM was included to serve 
as a positive control for inhibition of proliferation. Cells without addition of protein but with FGF 
added and without FGF added were included on each plate as additional controls. 

After the 72 h incubation, the media was removed, and the plates were washed twice with 
PBS and frozen at -80 °C. Proliferation of the BCE cells was assessed using the CyQUANT® cell 
proliferation assay kit according to the manufacturer's recommendations. 

Conclusions 
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The insertion of the EC binding peptides into MSA increased their inhibitory activity by 
approximately 1000-fold. The MSA-EC binding peptide fusions inhibited BCE and HUVEC 
proliferation in the nanomolar (nM) range while the synthetic peptides were active in the micromolar 
(mM) range. The control MSA and MS A-myc proteins did not significantly affect the proliferation 
of the target endothelial cells. 

Induction of tumor cell apoptosis by MSA-RGD fusions. Peptides containing the RGD 
(Arg-Gly-Asp) motif have been shown to induce apoptosis in a caspase-3 dependent manner through 
the promotion of pro-caspase3 auto-cleavage and activation (Buckley et al., 1 999). It was therefore 
of interest to determine if the MSA-RGD fusion was also capable of inducing apoptosis. To test this 
hypothesis, human non-small cell lung carcinoma cells (NCI 1869) were plated on the membrane of 
a transwell insert. These cells were incubated to allow attachment. Cos7 cells were transfected with 
a plasmid containing cDNA encoding (pcDNA MSA-RGD/53) for the expression and secretion of 
the respective fusion protein. An empty vector (pcDNA3) was transfected in parallel as a negative 
control. After 24 hours, the transwell insert carrying the NCI- 1869 cells was transferred to the plate 
containing the Cos7/MS A-RGD transfectants. The cells were co-incubated for an additional 24 hours. 
The NCI- 1869 cells were then recovered and incubated in PBS/Mg++ containing the fluorometric 
Caspase-3 substrate, DEVD-AFC. Cleavage of this fluorogenic tetrapeptide substrate by Caspase-3 
generates a fluorescent signal, which is read in a fluorometric plate reader as a measure of the 
induction of apoptosis. 

The results presented in Figure 8 (each bar is the average of 3 independent samples) 
demonstrate that the secretion of MSA-RGD by Cos7 cells leads to a 4.9 fold induction of apoptosis 
relative to the vector control in NCI- 1869 cells. Incubation of these cells with purified RGD peptide 
also leads to the induction of apoptosis as assessed by microscopic analysis. 

The skilled artisan will recognize many equivalents to the disclosed invention, all of which 
are intended to be within the scope of the present invention. All articles, patents, and applications 
cited above are incorporated herein by reference. 
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